Vision Function Layer in Multimodal LLMs
arxiv.org·1h
📊Learned Metrics
Show HN: Katakate – Self-hosted safe VMs for AI compute
github.com·17h·
Discuss: Hacker News
📦Container Security
Improving dired-show-file-type
mbork.pl·11h
📦Archive Formats
Preparing Video Data for Deep Learning: Introducing Vid Prepper
towardsdatascience.com·11h
🎞️MKV Forensics
AI Is Great at Parsing
keeb.dev·1d·
Discuss: Hacker News
🧠Learned Codecs
QuasarQ
hackster.io·1d
⚙️DIY Electronics
New to TrueNAS — 3 pools (SSD / media / NAS) or just 2? Also: bare metal or Proxmox?
reddit.com·9h·
Discuss: r/homelab
💾Proxmox Storage
How much do you really know about media queries?
frontendmasters.com·13h·
Discuss: Lobsters
🏺Media Archaeology
Meta launches Vibes, a new way of creating and remixing AI videos
techradar.com·20h
🎬WebCodecs
New obscure formats
codecs.multimedia.cx·2d
💿FLAC Archaeology
GPUs: Anatomy of high performance matmul kernels
aleksagordic.com·12h·
🖥️Terminal Renaissance
Announcing Incus 6.17
stgraber.org·14h
🔓Open Source Software
Sonnet 4.5 ranks #25 (below other Claude models) in generating SQL
tinybird.co·10h·
Discuss: Hacker News
📡RSS Automation
Show HN: Wan 2.5 vs. Veo3 Who Deserves the AI Video Throne?
wan2video.com·19h·
Discuss: Hacker News
🎬WebCodecs
😮‍💨 I created my own face recognition system
dev.to·13h·
Discuss: DEV
🌀Fractal Compression
RIP Graphics (altervista.org)
kwasstuff.altervista.org·11h
📟Terminal Physics
Revisiting bsdiff as a tool for digital preservation
exponentialdecay.co.uk·1d
💿FLAC Archaeology
AISHELL6-whisper: A Chinese Mandarin Audio-visual Whisper Speech Dataset with Speech Recognition Baselines
arxiv.org·1h
🎙️Whisper
100X Faster: How We Supercharged Netflix Maestro’s Workflow Engine
netflixtechblog.com·13h·
Discuss: Hacker News
🌊Streaming Systems
Sguaba: Type-safe spatial math in Rust
youtube.com·15h
🦀Rust Borrowing